The Syllabus Based Web Content Extractor (SBWCE)

نویسندگان

  • Saba Hilal
  • S. A. M. Rizvi
چکیده

Syllabus Based Web Content Extractor (SBWCE) introduces a new technique of Syllabus Based Web Content Mining. It makes the Syllabus Based Web Content Extraction easy and creates an instant online book view based on the links relevant to the given Syllabus. Three important contributions are made by the current work. First, as multiple format educational information is needed for Syllabus based content; the technique used makes the finding of such content easier. Second, a new approach for capturing and recording the heuristics involved during searching by experts is used. Third, the grouping of Syllabus Words for precise extraction is exploited. This paper introduces SBWCE and presents the related details.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

An automated syllabus digital library system for higher education in Ireland

Purpose – With the significant growth in electronic education materials such as syllabus documents and lecture notes, available on the internet and intranets, there is a need for robust central repositories of such materials to allow both educators and learners to conveniently share, search and access them. The purpose of this paper is to report on the work to develop a national repository for ...

متن کامل

Using Automatic Metadata Extraction to Build a Structured Syllabus Repository

Syllabi are important documents created by instructors for students. Students use syllabi to find information and to prepare for class. Instructors often need to find similar syllabi from other instructors to prepare new courses or to improve their old courses. Thus, gathering syllabi that are freely available, and creating useful services on top of the collection, will yield a digital library ...

متن کامل

A Pattern-based Annotation Approach: an Ontology-driven Rote Extractor for Pattern Disambiguiation

EXTRACTOR FOR PATTERN DISAMBIGUIATION by SHENG YIN (Under the Direction of Ismailcem Budak Arpinar) ABSTRACT One difficulty that prevents a machine from searching, retrieving and processing web content through the World Wide Web (WWW) is that most web content is presented in natural language, which cannot be processed by a machine. The current pattern-based annotation approaches can generate pa...

متن کامل

Concept Extractor - Ein flexibler und domänenspezifischer Web Service zur Beschlagwortung von Texten

We describe a flexible and modular system for keyword extraction and attribution which operates on top of a text mining engine. Texts are analysed in comparison with a large reference corpus and key words are determined using a frequency based method for determining relative term significance. Additionally, selected terms may be expanded using large knowledge bases on inflected forms, orthograp...

متن کامل

Location-based Web Search

In recent years, the relation of Web information to a physical location has gained much attention. However, Web content today often carries only an implicit relation to a location. In this paper, we present a novel location-based search engine that automatically derives spatial context from unstructured Web resources and allows for location-based search: Our focused crawler applies heuristics t...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2008